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We apply a method proposed by members of CTEQ Collaboration to estimate the uncertainty in 
associated l-F-Higgs boson production at Run II of the Tevatron due to our imprecise knowledge of 
parton distribution functions. We find that the PDF uncertainties for the signal and background 
rates are of the order 3%. The PDF uncertainties for the important statistical quantities (significance 
of the Higgs boson discovery, accuracy of the measurement of the WH cross section) are smaller 
(1.5%) due to the strong correlation of the signal and background. 

o 
o 

^vq I The steady improvement of world hadronic data has stimulated significant interest in quantitative estimates 
of theoretical uncertainties due to incomplete knowledge of parton distribution functions (PDFs) [1-7]. The 
published studies discuss the impact of uncertainties on the shape of the PDFs or simple observables, such as 
the total cross section for the W— boson production at Tevatron. It is interesting to apply the proposed methods 
to more involved observables, such as cross section distributions or the ratio of the signal to background in the 
t-H | search for new physics. 

To illustrate new issues that such application involves, consider how PDF uncertainties affect the potential 
of discovery of the Standard Model Higgs boson with the mass Mh = 115 GeV via associated WH production 
at Tevatron. In this process, Higgs bosons can be discovered by observing excess production of bb pairs with 
the invariant mass close to Mjj, e.g., in the band 95 < M bb < 135 GeV. The WH signal consists of two tagged 
, 6-quark jets, a lepton, and missing transverse energy associated with the unobserved neutrino from the W 
decay. The dominant background process is direct Wbb production. In an experimental analysis, this QCD 
background would be estimated by extrapolation of the M bb distribution from the regions of M bb where the 
WH cross section is negligibly small, e.g., from the side bands 75 < M bb < 95 GeV and 135 < M bb < 155 GeV. 

We model both the signal and background with tree-level matrix element calculations using MADGRAPH || 
at a hard scattering scale fi 2 = s. To simulate the resolution of the hadron calorimeter, we smear the jet energies 
' with a Gaussian of width AEj/Ej = 0.80/ y/Ej 0.05 (added in quadrature). We simulate the acceptance of 
the detector by using the selections listed in Table Q. An isolation cut is placed on the lepton, as defined by a 
cone of radius AR. We use the impact-parameter 6-tagging efficiency function defined in SHW 2.3 ||. 

The PDF uncertainties influence the potential for the discovery of the Higgs boson in several ways. First, 
^3 • they affect the shape of the M bb distribution and, therefore, the accuracy of the extrapolation of the background 
from the side bands. Second, the relative errors for statistical quantities, such as the ratio S/B of the signal and 
• ^ background rates, may differ significantly from the relative errors for S and B if the latter ones are correlated 
or anticorrelated. Third, the errors for S and B are likely to be different in the positive and negative directions, 
commonly due to the changes in the shape of the distributions da/dM bb under the variation of the PDFs. Hence, 
the integrated distribution may be less constrained in the positive direction than in the negative direction. 

As a first step in the study of the above issues, we estimate the PDF uncertainties for S and B with the 
method proposed by J. Pumplin, D. Stump, Wu-Ki Tung et al. (PST) @. The PST method is based on 
diagonalization of the matrix of second derivatives for \ 2 (Hessian matrix) near the minimum of x 2 - Since x 2 
is approximately parabolic near its minimum Xo > hypersurfaces of constant x 2 are hyperellipses in the space of 
the original 16 PDF parameters {a^}. By an appropriate change of coordinates {a^} — > {z^}, i = 1, . . . , 16, we 
can transform hyperellipses into hyperspheres. We assume that all acceptable PDF sets correspond to x 2 that 
does not exceed its minimal value Xo more than by T 2 . As a result, the acceptable PDF sets have {z{\ within 
a sphere of the radius T 2 around {2j(Xo)} = { z i}- We present the results for T = 10. Our global analysis of 
the PDFs uses the same set of hadronic data as in Ref . [ 7) . 

Table I: Cuts used to simulate the acceptance of the detector at the Tevatron run II. 

M<2 E Tb >20GeV 
M < 1.5 £ T i>20GeV 
\ARu\ > 0.7 P T > 20 GeV 
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The PDF uncertainty for an observable O is the maximal change in O as a function of variables {zi} varying 
within the tolerance hypersphere. The PST method estimates the variation of O as 
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and t = 5 is a small step in the space of Zj. For brevity 0(z^, 
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is denoted as 0(zf ± t). 



The PDF error (jlj) is a combination of 32 cross sections, each of which is known with some uncertainty due to 
the Monte-Carlo integration. One might be concerned about accumulation of Monte-Carlo errors in the process 
of calculation of SO. Fortunately, this accumulation does not happen, because the calculation of SO in Eq. (|l|) 
and the propagation of Monte-Carlo errors involves only summation in quadrature. The Monte-Carlo error 
AmcSO for SO is given by 
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where AjvfcO; are Monte-Carlo errors for O calculated with the PDF set i. If all AmcO; are approximately 
the same (A M cOi fa A for i = 1, . . . 32), Eq. (§) simplifies to 
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Hence, in this case the Monte-Carlo error for SO is proportional to the Monte-Carlo error for Oi and not to the 
number of the PDF parameters. In our calculation, the Monte-Carlo uncertainty of SO does not exceed 20% of 
SO. 

In addition to the symmetric error SO, it is useful to estimate maximal variations of O in the positive and 
negative directions, given by 
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These variations define the true allowed range for O and may differ significantly from SO. For instance, SO for 
the total cross section of VF ± -boson at the LHC is 5%, while SO- and SO+ are —3 and +7%, respectively. 

In the associated WH production at the integrated luminosity j Ldt = 15 fb -1 , we expect the following 
numbers of the signal and background events in the signal band 95 < M b i < 135 GeV: 



5* = 49.7+{;^ (3.0%), B = 110.1+r (3.1%) 



(G) 



The number in parentheses is the symmetric relative error for S or B estimated with the help of Eq. (Q). 
Common statistical combinations of S and B are 

(7) 
(8) 



S/B = 0.451^ (1.5%), S/VB = 4.731^(1.8%), 



VS + B/S 



0.25418:88(1.7%). 



According to Eqs. (0-||), the errors for the statistical quantities are very asymmetric. In the lower (75 < M b i < 95 
GeV) and upper (135 < M bh < 155 GeV) side bands, we expect 94.5l|?(4%) and 30l^ 6 (5%) background 
events, respectively. 

Eqs. (§-||) also show that the PDF errors on S/B, S/\/B, y/S + B/S are smaller (1.5%-1.8%) than the un- 
certainties on S and B (~ 3%), which signals a correlation between the PDF errors for S and B. We can 
get a feeling of this correlation by studying correlations of individual variations SSi/S and SBi/B (Fig. 0). On 
average, the magnitude of SSi/S is larger than the magnitude of corresponding SBi/B, since the "average ratio" 
of these magnitudes 2j=i \ {^Si/ S) / {SBi/ B)\ = 1.53 exceeds unity. However, we are more interested in the 
correlation of the largest values of SSi/S and SBi/B, which give dominant contributions to the total relative 
errors of S, B, and statistical quantities. According to Fig. [l|, the largest SSi/S and SBi/B are well correlated, 
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Figure 1: Spread of 5Si/S (circles) and SBi/B (boxes) for 
j = l,...,16. 



Figure 2: The total signal plus background M h i distribu- 
tion (solid line) as compared to the PDF uncertainty band 
for the background (dashed lines). 



so that their contributions to 5(S/ B)/ (S/ B), etc., cancel. As a result, the relative errors for the statistical 
quantities are smaller than the relative errors for S and B. 

The overall correlation of vectors {SSi/S} and {SBi/B} can be quantified by introducing the cosine of the 
angle between these vectors jf): 

1 16 

008 ^ ^5SSB £ 6SiSBi - (9) 

i— 1 

Then, the relative error for A = S/ B p is 

fSA\ 2 f6S\ 2 fSBP\ 2 6SSBP , . 

and correlated or anticorrelated S and B correspond to cos ip — 1 or — 1, respectively. Similar correlation angles 
can be calculated for any pair of relative errors, including the errors for backgrounds in the upper and lower 
side bands. We find that cos ip for S and B, S + B and S in the signal band are 0.89, 0.95, respectively, i.e., the 
correlation is very good. The correlation cosine between the background cross sections in the lower and upper 
side bands is also large (0.62), which indicates that the PDF uncertainty mostly affects the overall normalization 
of the M b i distribution and not its shape. Correspondingly, the extrapolation from the side bands accurately 
approximates the background in the signal band. Fig. || illustrates the relative size of PDF uncertainties for the 
background in comparison to the signal plus background distribution. 

To conclude, we propose to use asymmetric PDF errors and correlations between PDF errors in detailed studies 
of PDF uncertainties. Using these quantities, we find that the cumulative effect of the PDF uncertainties on the 
significance for the discovery of the Higgs bosons at Tevatron is not large (~ 1.5—1.8%). The PDF uncertainties 
for the signal and background (~ 3%) are much smaller than the eventual statistical errors for the measurement 
of the WH cross section (> 25%) even if Tevatron Run II accumulates 15 fb _1 of the integrated luminosity. 
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